CDS
Accession Number | TCMCG075C02091 |
gbkey | CDS |
Protein Id | XP_017980197.1 |
Location | complement(join(12958629..12958878,12959320..12959826,12960451..12961118,12961676..12961774,12961897..12962126,12962209..12962260,12962439..12962625,12962709..12962921,12963045..12963214,12965885..12966160,12966428..12966919,12967667..12967942,12969011..12969406)) |
Gene | LOC18612437 |
GeneID | 18612437 |
Organism | Theobroma cacao |
Protein
Length | 1271aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018124708.1 |
Definition | PREDICTED: histidine kinase 2 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGAGTTGTTCTTCTGGAACTGGGAATTTTGTGAAGCTCTCAAGGCTCCTTGGGGAAATACGTAAGTGTGCTTTGGTCAAGATGTCTATGAACGGCAAGCTTTCTGGTTCTAATTGTAGATTATCAGCAAATTTCAGGCTGAAGAAGGCAAAAGAGACTATGCATGGGCCCAATTCTTTCAGGAAATGGAAGAGAAACCTTCTCTTTCTCTGGCTTTTAGGCTTTGTTTCGACAGGAATTATTTGGTTTTTCTTGAGTTTCAATAGTGTAGCTTCGGAGAGGAATGAGAAAAGTCCTGATTCTTGTGATGAGAAGGCAAGAATCTTGCTCCAACATTTCAATGTTAGCAAGAACCAGTTTCATGCTCTAGCTTCTTTCTTCTACGAATCAGATCAGATAAAATTCCTCGAATGTACCAGAGATTCAGGACCTAAAAAGCCATCAAGTGATGGTATTGCCTGTGCTCTTAAGGTACTGTGTTCAGAGCACCAAGACCTCAGAAAGCAGCAGATGTGGGTTGTAAGAAATACAGAACTTAAGGATCAATGCCCAGTCCAAGTTGAGAATATTCCCAGCGAGCATGACTTGTCATTGCTGGAGCACGATACCTTATCATTTGTCTCACAAATTGCAGTTTCATTAGTATCATGGGAGCATCACAGTGGTGGAAAGAACATCTCACAAAGAAGTGCACTAGGAGTCGAATCAAAAGACAATTGTGAGAACTTGTCATTTTGTATGGTGAAAGGATGTTGGTTGCTTCTTGTTGGAGTGATACTGAGCTGGAAGATTCCTGGAGTTCGTTTGAAGCTCTGGAGGAACAGAAAGAATGAGCCAGCTCTGCTGCAGCCTGTGGCTCAGCAACTACCGCTGCTGCTGCAACAGAAGCAGCAGCAAACCCAGAGCCCTCCTAAAGGTGCAGGGAAGTGGAGAAAGAAACTCTTAATAACATTTGTATTTGTGGGGATCTTTACATCCTTCTGGTTATTTTGGCATTTAAACCAAAAGATCATTTTAAGGAGAGAAGAGACACTTGCCAACATGTGTGATGAAAGAGCACGGATGTTGCAGGATCAGTTCAATGTTAGCATGAATCATGTTCATGCGTTGGCTATTCTCGTATCCACTTTTCACCATGGGAAGCATCCATCTGCTATTGATCAGAAAACATTTGGTGAATATACTGAAAGAACAGCTTTTGAGAGGCCACTTACTAGTGGTGTCGCTTATGCACTGAAAGTTCTTCACTCAGAGAGGGAGCAGTTTGAGAAGCAGCATGGATGGACAATAAAGAAAATGGAAACTGAGGACCAGACTTTGGTCCAAGATTGCCTGACAGAAAATTTGGATCCTGCACCCATTAAAGATGAATATGCACCAGTAATATTTTCACAAGAAACTGTGTCTCATATTGTTTCTATTGACATGATGTCTGGAAAGGAAGACCGTGAGAACATCCTGCGGGCAAGGGCAACTGGAAAGGGAGTATTGACATCTCCTTTTAAGCTGTTAAAATCCAATCACCTTGGTGTTGTTCTCACATTTGCTGTTTATAACAAGGATTTGCCTCCAAGTGCTACACCAAGGCAACGAACTGAAGCTACTGTGGGGTACCTGGGTGCGTCTTATGATGTCCCCTCTCTGGTGGAGAAGCTTCTGCACCAACTTGCCAGCAAGCAAACCATTGTTGTCAATGTTTACGACACAACCAATGCATCTGCTGCCATCAGCATGTACGGTACTGATGTAACTGATACTGGCCTACTGCATGTCAGTAGCCTTGATTTTGGAGATCCATTAAGGAAGCATGAGATGCACTGCAGGTTCAAGCAAAAACCCCCGTTACCTTGGACAGCAATTAATGCATCAGTAGGAGTCCTAGTTATTACTTTGCTTGTCGGTCATATCTTCCATGCTGCTATATGTCGAATTGCAAAAGTAGAGAATGACTACCGTGAGATGATGGAGCTCAAAGCTCGTGCTGAAGCTGCAGATGTGGCCAAATCTCAGTTTCTAGCAACTGTTTCCCATGAGATCAGGACTCCGATGAATGGTGTTTTAGGTATGCTGAAAATGCTGATGGATACAGAGCTTGATGCGATCCAAAGGGACTATGCTGAGACTGCTCATGCTAGTGGGAAAGATCTTATCTCACTGATAAATGAGGTCCTTGATCAGGCTAAGATAGAATCAGGCAGGCTTGAGCTTGAGGATGTGCCCTTTGATCTACGCACTCTTCTTGATAACGTCCTCTCACTTTCCTCAGACAAATCTAATTATAAAGGGATTGAGTTGGCAGTTTATGTATCCGATCGGGTTCCTGAAGTTGTTGTTGGTGATCCCGGGCGGTTTCGGCAAATAATTACAAATCTTGTTGGAAATTCAATTAAGTTCACGCAGGATAAGGGACATATTTTTGTCTCAGTGCATCTGGTAGATGAAGTGAAGGGTGCATTTGATGTGGGAGACAAGGTGCTGCAACCAGGCTTGAACTTAGTTCAAGACATGTCAAGCAAAACATATAATACGTTAAGTGGGTTTCTAGTGGTGGACAGGTGGAGAAGCTGGGAGAACTTTACAATACTAAATGGCAAAGACTCAATGGAGGATCCTGAAAAGATTAAATTACTAGTAACAGTTGAGGACACAGGTGTGGGAATTCGTTTAGATGCACAGGATCGAATTTTCACTCCTTTTGTGCAAGCTGACAGTTCCACTTCACGACATTATGGTGGGACTGGAATAGGATTGAGCATCAGCAAACGTCTTGTACAACTCATGCATGGGGAGATCGGGTTTGTGAGTGAACCTGGCACTGGCAGTACTTTCTCATTCACTGCAGCTTTTGGAAAAGGTGAAGCGAGTTCTCTGGATTCAAAGTGGAAGCAATATGATCCAGTGATTTCGGAGTTCCAAGGTTTGGGAGCACTGATTATTGATAATAGAAGCATCCGAGCTGAGGTTACAAGATACCATCTTCGGAGATTGGGAATATCTGTGGATATAACTTCCAGTATGGAGTTAGCGTACACCTATCTGTCAAGCACTTGTGGCACAAGTGCATTTGCACATTTGGCCATGATTCTTATTGACAAAGATGTTTGGAATCAGGAAACAGTTCTTCAGTTACGATCTTTGCTCAAAGATCATAGGCAAAATGACAGAGTAGATGTTTCGACAAACCTTCCAAAAATTTTTCTCTTGGCTACCTCCATGAGCCCGATTGAGCGCTCCAAGCTTAAGACTGCTGCTTTTGTAGATAATGTGCTGATGAAGCCACTTCGGTTGAGTGTCTTGATTGCCTGTTTCCAAGAAGCCCTTGGAAATGGTAGAAAGGAGCAAGTACATAGAGAGAGAATGTCTACGCTTGGGAGCTTACTACGAGAAAAGCGGATTTTAGTGGTTGATGACAATAAGGTTAACAGAAGAGTGGCAGAAGGTGCTTTAAAGAAATATGGAGCAATTGTTTCCTGTGTGGAAAGAGGCCAGGATGCGCTGCACAAGCTTAAGCCACCCCATAATTTTGATGCTTGCTTCATGGATCTCCAGATGCCAGAAATGGATGGGTTTGAAGCTACTAGGCAAATCCGCTGCGTGGAGAGTGAGGTCAATGAAAAAATTGTTTCTGGAGAAGCATCCATTGAGATGTACGGAAATGTGCATCAATGGCACATTCCAATTTTAGCAATGACAGCTGATGTCATCCAAACTACAAATGAAGAGTGCATGAAATGTGGGATGGATGGCTATGTGTCAAAGCCTTTTGAGGAAGAGCAACTTTATTCAGCTGTTGCAAGTTTTTTTGAGTCTGGTTGA |
Protein: MSCSSGTGNFVKLSRLLGEIRKCALVKMSMNGKLSGSNCRLSANFRLKKAKETMHGPNSFRKWKRNLLFLWLLGFVSTGIIWFFLSFNSVASERNEKSPDSCDEKARILLQHFNVSKNQFHALASFFYESDQIKFLECTRDSGPKKPSSDGIACALKVLCSEHQDLRKQQMWVVRNTELKDQCPVQVENIPSEHDLSLLEHDTLSFVSQIAVSLVSWEHHSGGKNISQRSALGVESKDNCENLSFCMVKGCWLLLVGVILSWKIPGVRLKLWRNRKNEPALLQPVAQQLPLLLQQKQQQTQSPPKGAGKWRKKLLITFVFVGIFTSFWLFWHLNQKIILRREETLANMCDERARMLQDQFNVSMNHVHALAILVSTFHHGKHPSAIDQKTFGEYTERTAFERPLTSGVAYALKVLHSEREQFEKQHGWTIKKMETEDQTLVQDCLTENLDPAPIKDEYAPVIFSQETVSHIVSIDMMSGKEDRENILRARATGKGVLTSPFKLLKSNHLGVVLTFAVYNKDLPPSATPRQRTEATVGYLGASYDVPSLVEKLLHQLASKQTIVVNVYDTTNASAAISMYGTDVTDTGLLHVSSLDFGDPLRKHEMHCRFKQKPPLPWTAINASVGVLVITLLVGHIFHAAICRIAKVENDYREMMELKARAEAADVAKSQFLATVSHEIRTPMNGVLGMLKMLMDTELDAIQRDYAETAHASGKDLISLINEVLDQAKIESGRLELEDVPFDLRTLLDNVLSLSSDKSNYKGIELAVYVSDRVPEVVVGDPGRFRQIITNLVGNSIKFTQDKGHIFVSVHLVDEVKGAFDVGDKVLQPGLNLVQDMSSKTYNTLSGFLVVDRWRSWENFTILNGKDSMEDPEKIKLLVTVEDTGVGIRLDAQDRIFTPFVQADSSTSRHYGGTGIGLSISKRLVQLMHGEIGFVSEPGTGSTFSFTAAFGKGEASSLDSKWKQYDPVISEFQGLGALIIDNRSIRAEVTRYHLRRLGISVDITSSMELAYTYLSSTCGTSAFAHLAMILIDKDVWNQETVLQLRSLLKDHRQNDRVDVSTNLPKIFLLATSMSPIERSKLKTAAFVDNVLMKPLRLSVLIACFQEALGNGRKEQVHRERMSTLGSLLREKRILVVDDNKVNRRVAEGALKKYGAIVSCVERGQDALHKLKPPHNFDACFMDLQMPEMDGFEATRQIRCVESEVNEKIVSGEASIEMYGNVHQWHIPILAMTADVIQTTNEECMKCGMDGYVSKPFEEEQLYSAVASFFESG |